--------------------------------------------------------------------------------------------------------------------------------------------README for: INCENTIVES AND CAREERS IN ACADEMIA: THEORY AND EMPIRICAL ANALYSIS, The Review of Economics and StatisticsBy: Daniele Checchi, Gianni De Fraja and Stefano VerzilloQuestions: daniele.checchi@unimi.it / stefano.verzillo@ec.europa.eu / gianni.defraja@notthingam.ac.ukyyyy-mm-dd: 2020-02-11--------------------------------------------------------------------------------------------------------------------------------------------REPLICATION INSTRUCTIONSThis folder contains all source codes used to generate tables and results for the paper "Incentives and Careers in Academia: Theory and Empirical Analysis".  All files ending in .do should be executed in Stata (version 13.0 or higher) and all files ending in .sas/.rtf should be executed in SAS/SQL. The .do files contain standard commentary.The datasets reporting the administrative records of Italian academics (assistant, associate and full professors) since 1991 and their publications in Web of Science are restricted-use datasets for confidential or commercial reasons respectively. To replicate results in the paper, upon having granted access to the same datasets, the following files should be used:(a) editor_corpodocente.rtf     > This file merge miur data on academics (people.dta) with WoK publications (nisi.dta and citations.dta) (b) editor_jcr2012.sas     > Sas editor to generate Journal impact factors from JCR source files (Jcr_2012.csv)(c) Final.xls     > Matrix for disambiguation of homonymies(d) enrolled_students_from_miur.do     > This file generates Immatricolati_Provenienza_Finale.dta from MIUR data to be used in RESTAT_R2.do(e) bibliom.do     > This file generates individual h-index from average time profile of citations accrual in WoK data (citations.dta), estimates an H-index for each professor/year and attach them to individuals (people.dta) generating hindex.dta  (f) datacdv2.do    > This file generates an individual-year panel dataset named datacdv2.dta from MIUR's individual archive (people.dta) and Web of Knowledge data on publications (nisi.dta and hindex.dta)(g) meritp2.do    > This file generates the measures of the orderliness index (Mass2extended.dta) to be used in final analysis (h) creation relevant dataset .do    > This file generates the individual-year panel dataset (datacdv2.dta) to the individual-period final dataset ready for the analysis (regression10.dta) (i) REStat_R1.do    > This file further polishes the final dataset (generating reg_Final.dta from regression10.dta) and then run all regressions and graphs + revisions (1st round) (l) REStat_R2.do    > This is a file which generates additional results requested after the second round of revisions.(m) curvepaperbandw.mw     > maple file for generation of the figures in the theory section of the paper.  --------------------------------------------------------------------------------------------------------------------------------------------DATA INSTRUCTIONSFour data sources are used in this paper. The data sources are described below along with the details on how to require access for research purposes to their data providers:(1) MIUR administrative records: Administrative data on all individuals with a post in an Italian university at any time between 1990 and 2011 from the Italian Ministry of University and Research (MIUR). Information on Italian academics' (given name, surname, university, faculty, national scientific sector) is publicly available at https://cercauniversita.cineca.it for the years from 2000 to 2011. The full version of the administrative dataset also contains age, gender and all academics in post from 1990 to 2000 is not publicly available. This information is necessary to link individuals to papers, and to organise the data in panel format. Access to the complete historical dataset was granted to the authors by the Statistics Department of the Italian Ministry of Education, University and Research � MIUR. Requests to access the same data should be directed to the Statistics Department of the Italian Ministry of Education, University and Research(ufficio.statistico@miur.it). This information should be converted into a Stata dataset called people.dta. The authors are willing, within reasonable time limits, to provide personal contacts within the Ministry offices for officials whose consent may be required for the release of this data.    (2) Thomson Reuters' Web of Knowledge Database: Every article published in the period 1990-2011 where at least one author listed an Italian institution among their affiliations as reported by the web-version of Thomson Reuters Web of Knowledge (WoK) database. The WoK database was accessed via both the University of Milan (Italy) and University of Leicester (UK) subscription licenses between January and June 2012. Publications and citations were then counted at the download dates. To obtain access to the WoK database a valid subscription should be obtained and downloading procedures may be (now) automatised via the WoK XML/API web service (https://clarivate.com/webofsciencegroup/solutions/xml-and-apis/). To get the same number of citations received at the time of our download (Jan-Jun 2012) by the articles collected an "ad hoc" request should be submitted to Thomson Reuters' Web of Knowledge Services, which may charge the user with an additional fee. This information should be converted into two Stata datasets called nisi.dta and citations.dta.(3) The 2012 Journal Citation Report: also from Thomson Reuters, it provides impact factor measures for every journal, as well as the research areas where each journal belongs. This dataset is available via a valid academic subscription (at http://help.incites.clarivate.com/incitesLiveJCR/JCRGroup/jcrHomePageExport.html). This information should be converted into a Stata dataset called jcr2012.dta.  (4) Data on students enrolled in Italian universities are collected annually by the Statistical Office of the Italian Ministry of Education, University and Research (MIUR) for administrative purposes. MIUR provides information on all flows of first-time enrolments for each degree course run by Italian universities. The dataset for each year can be freely downloaded from http://dati.ustat.miur.it/dataset/immatricolati and should be harmonised over time before constructing a panel dataset (see enrolled_students_from_miur.do). This information should be converted into a Stata dataset called Immatricolati_Provenienza_Finale.dta. While these instructions should suffice to replicate the results, the authors realise that this would be a complex operation, and that guidance may be needed. We will therefore respond to reasonable requests from researchers who, having obtained the necessary data, may require assistance in the replication of our results.--------------------------------------------------------------------------------------------------------------------------------------------